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BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention relates to providing access to and accessing a database 
and more particularly, providing access to and accessing an attribute based database, 
such as a catalog, using a hierarchical interface, such as a network file system type 
interface. 

Description of the Related Art 

In modern computer environments, several computer systems are typically 
connected together to facilitate the sharing of data and files. The interconnected 
computers, including the hardware and software used to connect them, are known as a 
network. A network can include any number of computer systems connected 
together. This connection can be as simple as wires between two computer systems 
or can be more elaborate, such as a local area network (LAN), a wide area network 
(WAN), or the Internet. 

Each computer system connected in the network can have a different operating 
system ("OS"). To communicate, the computer systems use standardized procedure 
primitives for exchanging data over the network, such as Remote Procedure Calls 
("RPCs"). To access the contents of another computer system, the requesting 
computer system, for example, converts a request into a common RPC protocol, such 
as the well-known Network File System ("NFS") protocol developed by Sun 
Microsystems, Inc. The NFS protocol was designed to allow different operating 
systems to share files across networks and allow computers to access files over a 
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network as if they were in local memory. The NFS protocol is compatible across 
different machines, operating systems, network architectures, and transport protocols. 

NFS represents data in a file system as a directory tree structure wherein each 
branch is a directory containing more directories and files. Branches terminate at 
5 directories that contain no sub-directories. The directory structure is represented as a 
string of directory names representing each level followed by a file name. The typical 
nomenclature used is: 

/f irst_level_directory/second_level_directory/f ile_A 

The directory structure can contain any number of directory levels and any 
10 number of files. Files can be found at any level of the directory tree structure. The 
first level directory is typically known as the "root directory." NFS is a common file 
system abstraction which is recognized by many software applications including word 
processing programs, email utilities, internet applications, and user specific 
applications. 

15 A software application that requests data is called a client. The computer 

system or application that provides data to a client is known as a server. A server can 
be, for example, known as a "file server" or a "database server" if the server controls 
access to a file system or a database, respectively. A computer system, which 
includes software applications executable thereby, can be both a client and a server, 

20 depending on the current activity, that is, requesting or providing data. The requested 
data can be local to the computer system or located on a different computer system 
connected via the network. 

When requesting data from a file system, a client will typically search through 
several directories, beginning at the root directory, searching for a particular file or set 

25 of files. This is accomplished using the NFS protocol, for example, when a client 
issues a "Read From Directory" procedure. The server responds to the client's 
request using the NFS protocol by returning the directory entries. The directory 
entries can be files and/or sub-directories. To search further into the directory tree 
structure, the client issues another "Read From Directory" procedure, identifying a 

30 lower level branch to be searched, one directory at a time. The NFS protocol has 
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many distinct procedures available to perform many different functions. Overall, 
NFS communicates in terms of directories and files. The data is provided as 
described above in the form of a directory tree. After requesting and receiving the 
contents of a directory, further requests from the client can begin at that directory 
5 level. The client does not need to start from the root directory for each subsequent 
request. The NFS protocol allows the client to search different levels of the directory 
tree through the use of file handles, described herein. 

When dealing with a large number of files, the files are often organized in a 
database, such as a catalog, rather than in a directory structure. If the files are 

1 0 organized in a directory structure, all files can be contained in one directory or 

organized into several directories. If contained in one directory, a NFS request to 
"Read From Directory" would return a long list of files which the client would have 
to search in order to find a particular file or set of files. If organized in a directory 
tree structure, the client may have to search through many different directories before 

1 5 finding the file or set of files needed. 

By organizing the files in a database, a client can request files according to file 
characteristics or attributes. Databases respond to data requests according to attribute 
values. This provides the maximum flexibility in searching and organizing the 
information. For example, data for an Internet based product catalog can include a 

20 large number of files, including a graphics file, a product description file, and a cost 
structure file for each product. The files can be organized according to various 
attributes including manufacturer, color, item size, season, price, type, etc. A client 
can structure a request specifying any combination of the various attributes to return a 
particular file or set of files, such as requesting all graphics files associated with a 

25 certain manufacturer or only files for the current season of a certain color. By 
organizing data in a database, a server can provide flexible queries and reports. 

To request data from a database, a query indicating the characteristics or 
attributes of the requested data must be formulated. SQL (Structured Query 
Language) is a typical database search language. SQL is a set-oriented language 
30 consisting of highly flexible commands that can be used to manipulate information 

collected in tables such as in a database. SQL was created as a language for databases 
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that adhere to the relational model. The relational model calls for a clear separation of 
the physical aspects of data from their logical representation. The model removes the 
details of how the data is stored and makes the access to data purely logical. For 
example, a file is specified by attributes and not physical size or location in a file 
5 system. Using SQL statements, the query specifies the tables, columns and row 
qualifiers to get to any data item. Many vendors offer SQL products on PCs, 
superminis, and mainframes. SQL has become the dominant database language of 
mainframes, minicomputers, and LAN servers. 

To access a database, a client must formulate and send a query to the server. 

10 In a SQL query, data has attributes, called classifiers, which are set to specific values. 
Using the product catalog example above, for a particular file the classifier SIZE can 
be set to CHILDRENS, the classifier COLOR can be set to GREEN, the classifier 
CATEGORY can be set to CHAIR, and the classifier TYPE can be set to GRAPHICS 
(SIZE = CHILDRENS, COLOR = GREEN, CATEGORY = CHAIR, TYPE = 

1 5 GRAPHICS). Database application specific software on a client that understands the 
database and the various classifiers formulates the SQL query. The server responds 
with raw data, typically a record set of all opaque data that meets the classifiers set in 
the SQL query. To further define the request, for example to add the classifier 
SEASON = FALL, the client must reformulate the entire SQL query. This method of 

20 accessing a database requires clients that access the database to have application 
specific software that enables them to formulate SQL queries. In addition, a client 
must typically reformulate an entire query to further define a search. 

It is difficult to control access to individual files specified by a database 
differently for each client. Using the product catalog example above, the database 

25 owner might want to allow a specific client to access only a certain set of files, for 
example, those with the classifier VENDOR having a value of COMPANY A. The 
database owner can desire to grant access to the entire database to another client. This 
method of granting different access to different clients is difficult to provide. In a file 
system environment, access rights can be associated with individual files or 

30 directories. However, even in the file system environment, it can be difficult to 
maintain the provision of different access rights to many clients. 



it Docket: T00029 



Attorney Docket No.: M-8696 US 



A simpler method is needed to provide clients access to the contents of a 
database. To provide access to a wide variety of clients, requiring each client to have 
database application specific software to formulate a query is often not practical. In 
addition, data and file protections should be easily controlled on a client-by-client 
5 basis. 

SUMMARY OF THE INVENTION 

In one embodiment of the present invention, a method of exporting data from 
a database is disclosed, wherein the database includes a number of classifiers of data 
and data linked to at least one of the classifiers. The method includes organizing each 
10 of the classifiers into a hierarchical data structure, and organizing the data into the 
hierarchical data structure according to the classifiers to which the data is linked. 

In one embodiment of the present invention, a hierarchical data structure of a 
database is disclosed, wherein the database includes a number of classifiers of data 
and data linked to at least one of the classifiers. The hierarchical data structure is 
1 5 generated by organizing each of the classifiers into a hierarchical data structure and 
organizing the data into the hierarchical data structure according to the classifiers to 
which the data is linked. 

In another embodiment of the present invention, the hierarchical data structure 
is a file system directory tree structure. 

20 In another embodiment of the present invention, the hierarchical data structure 

represents only a subset of the classifiers in the database. 

In another embodiment, the hierarchical data structure is a file system 
directory tree structure according to the NFS protocol. 

In one embodiment of the present invention, a method of providing a 
25 hierarchical interface to an attribute based database is disclosed. The method includes 
receiving a request in NFS protocol, translating the request into a database query, 
producing a result that is formatted according to the NFS protocol and sending the 
result to the client. 
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In another embodiment, the method further includes providing a file handle to 
a client upon an initial access request from the client, the file handle corresponding to 
a view in the database, wherein the view defines an amount of data in the database 
that is observable by the client. 

5 In another embodiment of the present invention, a system for providing a 

hierarchical interface to an attribute based database is disclosed. The system includes 
a data processing system having a memory coupled to at least one processor, wherein 
the memory comprises instructions for enabling the data processing system to receive 
a request in NFS protocol, translate the request into a database query, produce a result 
10 that is formatted according to the NFS protocol, and send the result to the client. 

The foregoing is a summary and thus contains, by necessity, simplifications, 
generalizations and omissions of detail; consequently, those skilled in the art will 
appreciate that the summary is illustrative only and is not intended to be in any way 
limiting. As will also be apparent to one of skill in the art, the operations disclosed 
15 herein may be implemented in a number of ways, and such changes and modifications 
may be made without departing from this invention and its broader aspects. Other 
aspects, inventive features, and advantages of the present invention, as defined solely 
by the claims, will become apparent in the non-limiting detailed description set forth 
below. 

20 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention may be better understood, and its numerous objects, 
features, and advantages made apparent to those skilled in the art by referencing the 
accompanying drawings. 

25 Fig. 1 is a block diagram illustrating a network environment in which 

embodiments of the present invention may be practiced. 

Fig. 2 illustrates the communication link between a client and a server. 
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Fig. 3 illustrates the structure of a file system as a directory tree as represented 
by NFS. 

Fig. 4 illustrates the process a client typically performs to search for data on a 
file server. 

5 Fig. 5 illustrates the process a client typically performs to access a database on 

a database server using an SQL query. 

Fig. 6 illustrates an example representation of a two dimensional database. 

Figs. 7A-7B illustrate example directory tree structures of a database. 

Fig. 8 illustrates the process a client performs to access a database on a server 
1 0 using an NFS representation of the data. 

Fig. 9 illustrates the process a server performs to map an ILocation to a file 

handle. 

Fig. 10 illustrates the process a server performs to translate an ILocation to an 
SQL query. 

15 Fig. 1 1 illustrates a block diagram of a computer system suitable for 

implementing embodiments of the present invention. 

Fig. 12 illustrates a software architecture diagram. 

The use of the same reference symbols in different drawings indicates similar 
or identical items. 

20 DESCRIPTION OF THE PREFERRED EMBODIMENT(S) 

The following is intended to provide a detailed description of an example of 
the invention and should not be taken to be limiting of the invention itself. Rather, 
any number of variations may fall within the scope of the invention that is defined in 
the claims following the description. 
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Embodiments of the present invention allow a client to access a database as if 
the database were a file system, representing the data in a hierarchical structure, such 
as a familiar directory tree structure. Embodiments of the present invention enable a 
client to access a database without special software or knowledge of the database 
organization and classifiers. In addition, embodiments of the present invention allow 
the server to restrict access to specific files on a client-by-client basis. 

Data is often conveniently stored and organized in a database. Databases 
often store information as represented in Fig. 6. Databases often require a large 
amount of memory, and many users often desire access to some or all of the data in a 
database. Additionally, searching a database in a reasonable amount of time often 
requires significant processing power and specialized software. For these reasons and 
others, databases are often stored in a central repository accessible by a server. 
Clients gain access to the database by communicating to such server. The client may 
request data import from the client to the server or data export from the server to the 
client. Users, especially graphical users who interact with a file system such as 
Microsoft Corporation's Windows Explorer file system, are generally familiar with 
the organization of information in a tree like structure. 

Thus, given that users often want access to data in a database and are generally 
familiar with hierarchical presentation structures, embodiments of the presentation 
allow the representation of data and database organizational information via a 
hierarchical database related data structure such as a file system like tree. 

For example, embodiments of the present invention allow users to display the 
data and database organization of Fig. 6 as the hierarchical data structure of Figs. 7A- 
7B. Additionally, embodiments of the present invention allow the user to tailor the 
hierarchical data related data structure to fit the needs of the user. For example, the 
user may only want to access data related to for example, company A and their 
products or all companies but only chairs. Additionally, the computer system 
exporting the hierarchical database related data structure can restrict access to data for 
which the user's credentials allow access. 
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Fig. 1 is a block diagram illustrating a network environment in which 
embodiments of the present invention can be practiced. As is illustrated in Fig. 1, 
network 100, such as a private wide area network (WAN), local area network (LAN), 
or the Internet, allows communication between various computer systems 110-160. 
5 The computer systems 1 10-160 may be clients, servers, or both. For example, 
computer system 110 may be a client with a terminal and user input and output 
devices such as a keyboard and monitor and computer system 130 may be a server for 
the sharing of data amongst some or all computer systems on network 100. Any 
number of computer systems, both clients and servers, can be connected to network 
10 1 00. Additionally, individual networks may be interconnected to form super 
networks. 

Computer systems 1 10-160 can be, for example, a computer system of any 
appropriate design, in general, including a mainframe, a mini -computer or a personal 
computer system. Such a computer system typically includes a system unit having a 

1 5 system processor and associated volatile and non- volatile memory, one or more 

display monitors and keyboards, one or more diskette drives, one or more fixed disk 
storage devices and one or more printers. These computer systems are typically 
information handling systems that are designed to provide computing power to one or 
more users, either locally or remotely. Such a computer system can also include one 

20 or a plurality of I/O devices (i.e. peripheral devices) which are coupled to the system 
processor and which perform specialized functions. Examples of I/O devices include 
modems, sound and video devices and specialized communication devices. Mass 
storage devices such as hard disks, CD-ROM drives and magneto-optical drives can 
also be provided, either as an integrated or peripheral device. One such example 

25 computer system is shown in detail in Fig. 1 1 . 

Fig. 2 illustrates the communication link between a client and a server. Clients 
and servers are separate logical entities that work together on a single computer 
system or between two or more computer systems over a network to accomplish a 
task. Server 210 services a number of clients 230(1 :N) at the same time. Often server 
30 310 performs the task of providing data to clients 230(1 :N) across the network 200. 
The data often resides in one or more databases, such as database 220. "(1 :N)" 
represents 1 through N, and "N" is a variable number. Server 210 regulates clients 
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230(1 :N) access to database 220 and other network resources (not shown). A 
software application can implement a client, a server, or both types of technology. 
Thus, servers are useful, for example, for sharing files across a network and for 
creating shared repositories of documents, images, engineering drawings, and other 
5 large data objects. 

In a typical communication cycle, client 230(1) requests service from server 
210. Client 230(1) typically suspends communication to server 210 until a reply is 
received from server 210. If accessing a file server, client 230(1) passes parameters to 
server 210 using a remote call procedure (RPC) such as NFS. If accessing a database 

10 server, client 230(1) formulates a database query using, for example, SQL. Client 
230(1) formulates a message containing parameters or attribute values and sends the 
message to server 210. Server 210 receives the message, interprets the parameters, 
calls the procedure, and sends the reply back to client 230(1). Clients initiate the 
dialog with a server by requesting a service. Typically, servers passively await 

15 requests from the clients. 

Fig. 3 illustrates the structure of a file system as a directory tree as represented 
by NFS. The NFS protocol allows a file system on one computing device to be 
exported to another computing device over a network. A file system, such as file 
system 300, is represented as a directory tree on a single server (usually a single disk 

20 or physical partition) with a specified root directory 3 1 0. Some operating systems 
provide a mount operation to make all file systems appear as a single tree, while 
others maintain a forest of file systems. NFS assumes a file system is hierarchical 
with each branch of the hierarchy having any number of sub-branches containing files 
and/or directories. A file can be any data structure including application programs, 

25 graphics files, text documents, service programs, databases, etc. Generally, all 

branches include a directory with the bottom level empty or consisting of only one or 
more files. 

Referring to Fig. 3, directory 310 can be a root directory, that is, the top level 
of file system 300 or can be a subdirectory of a higher level directory (not shown). 
30 Directory 310 can contain any number of files and directories. Here, directory 3 1 0 is 
shown with branches and sub-branches to two sub-directories, directory 320 and 
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directory 330. Directory 310 also contains three files, file 325, file 335, and file 345. 
Directory 320 contains two files, file 355 and file 365. File 355 and file 365 are 
considered as the bottom of the tree branch consisting of directory 3 1 0 and directory 
320. Directory 330 contains only one directory, directory 340. Directory 340 
5 contains two files, file 375 and file 385. File 375 and file 385 are considered as the 
bottom of the tree branch consisting of directory 3 1 0, directory 330 and directory 340. 
A directory can be empty, contain only files, or contain only directories. Assuming 
directory 310 is the root directory, the typical nomenclature used to represent file 375 
is 

10 f ile_system_300: /directory_310/directory_330/directory_340/file_375 

In the above nomenclature, the file system 300 is typically assigned an OS specific 
identifier. In DOS or Windows, this is typically a letter, such as "f ' and referred to as 
the "f: drive." In UNIX, this is typically a "mount point." 

A client communicating via NFS parses one component of a pathname at a 
1 5 time. For example, a client can request the contents of Directory 310 using the NFS 
procedure "Read From Directory." The server will respond with a listing of all 
Directory 310 entries, including Directory 320, Directory 330, File 325, File 335, and 
File 345. The server can send additional information, including file type, size, date 
created, etc. The client can then request the contents of Directory 320, and the server 
20 responds with a listing of File 355. 

The NFS protocol does not define the permission checking used by servers. 
However, NFS does provide authentication mechanisms so the server can identify 
which client is making a particular request. Once a client is identified, typically a 
server does normal operating system permission checking. In the "AUTH_UNIX" 
25 model, the server receives the client's UID (User Identification) and GID (Group 

Identification) on each call and uses them to check permissions. The "AUTHJJNIX" 
model is defined in NFS protocol description. If the client does not have permission 
to read directory 310, a request results in either an error message or a listing of no 
files, e.g., as if the directory were empty. 
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Locating servers in a network and further locating the files contained therein 
or accessible thereby is a relatively well-known technique. However, the process is 
described in some detail here for convenience. It will be recognized by those skilled 
in the art that there are many ways for computing devices to access files and utilize 
5 services over a network. 

Fig. 4 illustrates the process a client typically performs to search for data on a 
file server. Steps 41 0 through 440, indicated by box 405, implement the mount 
protocol. The mount protocol provides operating system specific services including 
looking up server path names, validating user identity, and checking access 
10 permissions. The mount protocol allows a server to hand out remote access privileges 
to a restricted set of clients. The mount protocol performs the operating system- 
specific functions that allow, for example, attaching remote directory trees to some 
local file system. 

When the server first starts up, the server typically advertises the logical 
1 5 location, i.e. the network address, and service provided by the server in the network 
directory (step 410). The network directory tracks available services and resources 
and their locations. A client can query the network directory for specific services, 
such as services from available resources like databases, printers, or applications. The 
client requests to access the server (step 420). After authenticating the client and 
20 checking authorizations, the server responds to the client's mount request by returning 
the location of the resource, identified by the root file handle (step 430). The root file 
handle identifies the root directory of the resource. Respective file handles are used 
to identify files and directories on the server's file system. A file handle (also referred 
to as "fhandle") can contain whatever information the server needs to distinguish an 
25 individual file or directory. Clients use the mount protocol to get the root file handle, 
which allows them entry into the file system of the server. The client stores the root 
file handle in a cache to use when accessing the server (step 440). 

A client only needs to mount the server once. By retaining the root file 
handle, the client can repeatedly access the server. Once mounted, the server now 
30 begins a cycle of receiving and servicing requests from the client. 
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The client sends a request to the server, for example, requesting the contents 
of the root directory (step 450). The client sends the root file handle as a parameter of 
the request. The server responds with requested data (step 460). The server also 
sends file handles for the files and directories supplied in the response. The "root file 
5 handle" refers to the location of the root directory on the server and is used to request 
information regarding the root directory. A "file handle" refers to the location of the 
associated file or directory and can refer to any file or directory. File handles, 
whether or not the "root file handle," are used whenever the client requests service 
and identifies the particular directory or file the client is interested in. The client does 

10 not interpret or decode the file handle. The file handle is an identifier used by the 

client to access a particular file or directory. The server assigns file handles according 
to server preferences, such that the file or directory is uniquely identified and is 
decodable by the server. The client stores the file handles, for example in a cache. 
The client does not decode the file handle and only uses the file handle to pass the 

1 5 server as a parameter indicating a particular file or directory. 

The client requests additional data, for example, the contents of a particular 
directory (step 470). The client sends the file handle associated with the particular 
directory. The server responds with new data, that is, the contents of the particular 
directory, files and directories, and the associated file handles (step 480). After the 
20 server has responded with the requested data (step 460 or step 480), the client can 
start a new data request at the root directory level by returning to requesting data by 
sending the root file handle (step 450). In addition, after the server responds with new 
data (step 480), the client can request additional data or search sub-directories (step 
470). 

25 Applying the process of Fig. 4 to the file system 300 of Fig. 3, and assuming 

directory 310 is a root directory, the client begins by requesting to mount the file 
system 300. After authenticating the client and checking authorizations, the server 
responds with the root file handle associated with directory 310. The client stores the 
root file handle and later uses the root file handle to access the contents of directory 

30 310. The client formulates and sends the request and the root file handle to the server. 
If the request was "Read From Directory," for example, the server responds by 
providing a list of directory 320, directory 330, file 325, file 335, file 345, their 
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respective file handles, and any other requested information such as file size, type and 
date created. To formulate and send another request, for example to request the 
contents of directory 320, the client need only formulate a new request and send the 
file handle associated with directory 320. 

Fig. 5 illustrates the process a client typically performs to access a database on 
a database server using an SQL query. Steps 5 10 - 540 are the mount procedure, 
similar to the mount procedure described in Fig. 4. The client application formulates 
an SQL query to search for a file or set of files (step 550). The query is formulated by 
database application specific software, such as the Trilogy Classification Engine 
("TCE") software available from Trilogy, Inc. of Austin, Texas, that understands the 
classifiers of the database. The query formulation can also include user input to set 
values of the classifiers, such as a user requesting to see all files with a particular 
attribute. The client requests data and data-related services (such as sorting and 
filtering) from a database server by sending the SQL query as a message to the 
database server (step 560). A client application can, with a single SQL statement, 
retrieve and modify a set of server database records or files. The client must have 
application code to formulate the query and access the database. 

The server queries the database (step 570). The code that processes the SQL 
request and the data typically resides on the same computer system. The server uses 
local processing power to find the requested data instead of passing all the records 
back to a client and then letting the client find the data. 

The server responds with the requested data. The results of each SQL query 
are returned over the network. The database server, also known as the SQL engine, 
responds to the client's requests and provides secured access to shared data (step 580). 
The SQL database engine can filter the query result sets, resulting in considerable data 
communication savings. 

The client returns to step 550 for any further queries of the database. The 
client formulates each query and searches the entire database contents for each 
request. 
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Fig. 6 illustrates an example representation of a two dimensional database. A 
database administrator or other such person creates database 600, organizing data by 
defining classifiers and assigning values for each datum. A datum identifies a single 
piece of information and can be a file, a record, or any other piece of data. The datum 
5 is referred to as a "blob object," that is, a large chunk of opaque data that the database 
stores but does not interpret. Each row in database 600 represents a different datum 
or piece of information. Each column in database 600 represents a different classifier, 
for example, VENDOR, CATEGORY, COLOR, SIZE, SEASON, and NAME. The 
classifier for each datum may or may not be assigned a value, as shown for Datum E 
1 0 where the classifier SIZE has the value MEDIUM and for Datum F where the 
classifier SIZE has a NULL value. 

The names of classifiers and the values given in database 600 are used for 
example purposes only, and are descriptive here but can be completely arbitrary. The 
database administrator organizes the database according to particular data and client 
15 needs. A database can contain one or more tables, such as the one shown in Fig. 6. 
The number of classifiers and amount of data contained in database 600 are used for 
example purposes only, and can vary widely in number according to the particular 
database. 

As previously discussed, to access data contained in a database, a client 
20 formulates a query. The query requests a listing of all data that have classifiers 
restricted to specific values. For example, a query of database 600 restricting the 
classifiers CATEGORY = TABLE and SIZE = MEDIUM returns a listing including 
Datum A and Datum E. Another query only restricting the classifier SEASON = 
FALL will return the same results, Datum A and Datum E. Since any number of 
25 classifiers can represent the datum, queries on different classifier values can return the 
same datum. As contrasted to a file system, there is only one way to return a specific 
datum, that is, by parsing specific directory levels to the specific file. As another 
example, a query restricting the classifiers CATEGORY = CHAIR and COLOR = 
OAK returns an empty set since no data meets the requested criteria. A query that 
30 does not restrict any classifiers to a specific value returns the entire contents of the 
database. 
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Users are generally familiar with the organization of information into a 
hierarchy, such as a file system. Users also often desire to access data in a database. 
Embodiments of the present invention present data in a database to a user in a 
hierarchical display. Thus, a hierarchical interface to the database and accessing the 
5 database using a hierarchical database increases the usability of the database. For 
example, embodiments of the present invention allow a client to access a database 
such as represented by database 600 as if the database were a file system, representing 
the data in a familiar directory tree structure. The data in database 600, represented as 
files in a directory tree structure, can be represented in a variety of ways; the ordering 

10 of the directories and sub-directories in each branch is variable based on a pre- 
selected ordering of classifiers. Two examples of such directory tree structures of 
database 600 are shown in Figs. 7A-7B. As illustrated in Figs. 7A-7B, the entire list 
of classifiers need not be specified, nor does the specific ordering matter. Fig. 7A 
shows a subset of the data, in particular, only those that have VENDOR = 

1 5 COMPANY A. Fig. 7A shows a larger set of data in a different ordering scheme. 

Each is a valid method of representing a particular datum. For example, depending on 
the specified ordering, Datum A can be represented by any of the following 
nomenclatures: 

/company_a/ table/medium/ da tum_a 
20 /table/medium/company_a/datum_a 
/f all/da tum_a 

Embodiments of the present invention enable a client to access a database 
without the client having special software or knowledge of the database organization 
and classifiers. According to embodiments of the present invention, a client can 
25 search a database utilizing, for example, the standard NFS protocol. The client need 
not be aware that the request involves accessing a database nor is there a need for the 
client to have application specific software to manipulate the database structure or 
assign any classifiers. 

In addition, embodiments of the present invention allow the database server to 
30 restrict access to specific files on a client-by-client basis. Thus, data provided from a 
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database using a hierarchical database interface can be provided with improved 
granularity or precision. This is achieved by providing what is referred to as a "view" 
for each client. A view is a created environment that defines what data a client sees 
and how that subset of data is presented to the user. Views restrict some classifiers to 
5 a specific value and set an order of viewing classifiers. A classifier restricted to a 
specific value is referred to as a "bound" classifier. A view with a bound classifier 
presents only data that meets that restricted value. A classifier not restricted to a 
specific value is a referred to as an "unbound" classifier. Unbound classifiers in a 
view set the order of viewing classifiers, presented as a defined order of sub- 
1 0 directories. A view can consist of any number of bound and unbound classifiers. 

View assignment to a particular client can be stored in a cache, table, database or can 
be assigned at mount time according to various client characteristics. 

A view can be represented by the generic formula given below: 

VIEWl: Classifier (l)=Valuel (x) , Classifier <2) =Value2 (y) , ... , 
15 Classifier (R) =ValueR(z) , Classifier (S) , Classifier (S+l) , 
... , Classifier (T) 

In View 1, above, classifiers(l.-R) are bound, and classifiers(S:T) are unbound. 
The ordering of bound classifiers in a view is not typically important. The only data 
viewed has classifier values set to Value l(x), Value2(y), . . ., ValueR(z). A view can 
20 be extremely restrictive, for example, a client can only observe a small portion of the 
database, or the view can contain no bound classifiers, allowing the client to observe 
the entire database. The more classifiers that are bound the more specific the files that 
are in the response listing. 

The ordering of unbound classifiers in a view is typically important. After 
25 filtering the data for the bound classifiers, the remaining data is presented in a 
directory tree structure according to the ordering of the unbound classifiers. For 
example, in View 1, above, Classifier(S) would represent the first level of directories, 
Classifier(S+l) yvould represent the second level of directories (a sub-directory of 
Classifier(S)), down to the last directory level, represented by Classifier(T). The 
30 Classifier would preferably not be the directory, such as TYPE from the database in 
Table 1 . Instead, the directories would be listed as the available values of the 

- 17- 

Client Docket: T00029 



Attorney Docket No.: M-8696 US 



classifier, such as TABLE or CHAIR. The directories listed are distinct, that is, if 
there are more than one occurrence of a specific value for the classifier, the value is 
only listed once. The client is able to parse the directories defined by the unbound 
classifiers, seeing only the data defined by the bound classifiers. A view without 
5 unbound classifiers is flat and contains no subdirectories. The data is presented as a 
large list of all files that meet the restricted values of the bound classifiers. If a value 
of a classifier for a datum is not set or has a NULL value, the datum appears as a file 
in the directory corresponding to the unbound classifier. 

The view for Fig. 7A contains a bound classifier and unbound classifiers. The 
1 0 view for Fig. 7 A can be represented as follows: 

VIEWAi VENDOR=COMPANYA, CATEGORY, COLOR 

The view for Fig. 7B contains only unbound classifiers. The view for Fig. 7B 
can be represented as follows: 

VIEWBt CATEGORY, SIZE 

1 5 Embodiments of the present invention allow data to be presented in a 

rearrange-able order based on the particular view and classifiers, bound and unbound. 
By preventing the client from observing data not meeting bound values, access to files 
can be restricted via the client view. Views can be assigned on a client-by-client 
basis, using bound classifiers to restrict access to certain data, allowing access 

20 protections to be client specific. Unbound classifiers are used to provide for ease of 
parsing the data available in the database. 

Fig. 8 illustrates the process a client performs to access a database on a server 
using an NFS representation of the data. The process from a client's perspective is 
similar to the process of Fig. 4, the process a client performs to search for data on a 
25 file server. 

Steps 810 - 840 represent the mount protocol. According to the present 
invention, the mount protocol sets up the client view. When the server first starts up, 
the server advertises the location and service provided by the server in the network 
directory (step 810). The client requests to access the server (step 820). After 
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authenticating the client and checking authorizations, the server responds to the 
client's mount request by returning a view file handle (step 830), which the client 
stores in a cache (step 840). Views are assigned to clients based on parameters 
configured by the database administrator. The view, in addition to restricting a 
5 client's access to certain data, also sets up the directory structure the client will see. 
The view file handle is similar to the root file handle in that the client uses the view 
file handle to access the database on the server and identifies the starting point for 
searching for data. However, clients can be given a different file handle 
corresponding to their particular view. In addition, rather than being mapped to the 
10 location of the root directory, the view file handle corresponds to an ILocation in the 
database for the client specific view. 

An ILocation is a temporary marker that indicates where in a view a client is 
currently working. A view is a special kind of ILocation. Views are named 
Ilocations, which indicate the starting location in a database for the client. Every 

15 directory the client sees is a separate ILocation. Note that files do not have associated 
ILocations. ILocations can be represented similar to views, as a set of zero or more 
bound classifiers and unbound classifiers. The server maintains an ILocation table, a 
mapping between ILocations and the file handles sent and received from the client. A 
client only needs to mount the server once. By retaining the view file handle, the 

20 client can repeatedly access the server. Because the view and the directory structure 
have been defined by the database administrator, when the client passes the view file 
handle to the server, many file protections are already defined. 

The client sends a request to the server, for example, requesting the contents 
of a directory (step 850). The client sends the view file handle as a parameter of the 
25 request. The client need not be aware that the data request involves searching a 
database. The client formulates all requests according to the NFS protocol. The 
client receives all responses in the NFS protocol wherein the data is represented as a 
directory tree structure. 

The server looks up the client view file handle in an ILocation table 
30 maintained by the server (step 855). The mapping from ILocation to file handle is 
important since all NFS clients understand file handles, but a database does not. A 
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database understands SQL, which is easily converted to and from an ILocation. Any 
method of uniquely assigning a mapping from file handle to ILocation, such as a table 
lookup, is acceptable. As previously mentioned, a file handle is for server use to 
identify a location, which is given to the client who stores and returns the file handle 
5 with requests, but does not interpret the file handle. 

Fig. 9 illustrates the process a server performs to map an ILocation to a file 
handle. The mapping from ILocation to file handle is accomplished with the use of 
cryptographic hash. The server changes the internal representation of an ILocation 
into a single byte array (string of Is and Os) (step 910). Next, the byte is 

10 cryptographically hashed (step 920). This yields a small byte array, which is then 
padded with bits, such as Os, to make the size of the byte array the size expected for a 
file handle (step 930). The mapping from ILocation to file handle is stored in a 
ILocation table maintained by the array (step 940). Files do not have associated 
ILocations and the server can assign file handles according to server preferences, such 

15 that the file is uniquely identified and is decodable by the server. 

Cryptographic hash functions are used in various contexts, for example to 
compute the message digest when making a digital signature. A hash function 
compresses the bits of a message to a fixed-size hash value in a way that distributes 
the possible messages evenly among the possible hash values. A cryptographic hash 

20 function does this in a way that producing a message that would hash to a particular 
hash value is difficult. The method of forming a cryptographic hash is well known in 
the art and is mentioned here as a preferred method of creating a unique mapping 
from the ILocation to a file handle. The mapping from file handle to ILocation is kept 
in a table or a database. A table is the preferred method of storing the mappings since 

25 they do not need to be retained for long periods of time. 

The server formulates an SQL query from the ILocation (step 860). Fig. 10 
illustrates the process a server performs to translate an ILocation to an SQL query. To 
formulate an SQL query, the server must first translate the ILocation to SQL 
parameter mapping. 
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Referring to Fig. 10, the server determines if there are bound classifiers (step 
1010). The bound classifiers determine the data that the client observes. If there are 
no bound classifiers, the set of data that the client observes includes the entire 
database and the server continues to step 1030. 

5 If there are bound classifiers, a WHERE clause in SQL is created for each 

bound classifier in the ILocation (step 1020). The WHERE keyword in SQL is used 
to conditionally select data from a table. For the bound classifier CLASSBFIERl = 
VALUE 1, the WHERE clause contains the condition CLASSMER1 = VALUE 1. 
Multiple bound classifiers cause multiple constraints in an SQL query. After step 
10 1020, the server continues to step 1030. 

The server determines if there are unbound classifiers in the ILocation (step 
1030). The unbound classifiers define the directory structure of the set of data that the 
client observes (as defined by the bound classifiers). If there are no unbound 
classifiers in the ILocation, there are no defined subdirectories in the directory 
1 5 structure, and the server continues to step 1060. 

If the ILocation contains unbound classifiers, the subdirectories are defined by 
the ordering of the unbound classifiers (step 1040). The set of data that the client 
observes is examined for values of the first unbound classifier by adding a SELECT 
DISTINCT clause to the query for the classifier. In other words, the clause searches 

20 the observable data in the database for a listing of values for the unbound classifier 
that are defined for some data. The SELECT keyword in SQL produces a result of all 
available values of the classifier (an available value means that at least one datum in 
the set has the value). The DISTINCT keyword in SQL, when used with a SELECT 
clause, eliminates duplicate values in the produced result. The result returned by the 

25 database will be represented as a listing of all subdirectories at the level in the 
directory structure corresponding to the ILocation. 

After step 1040, the server goes on to step 1050. The server must find all 
terminal (non-directory) files (step 1050). This is accomplished by setting the 
condition of a WHERE clause for the first unbound classifier to NULL. The result 
30 returned by the database will be represented as a listing of all files not represented by 
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the subdirectories resulting from step 1040. After step 1050, the server has completed 
the formulation of the query, step 1070. 

If the ILocation contains no unbound classifiers, there are no subdirectories in 
the directory tree, and the result will produce a listing a files or data that is observable 
5 by the client. The keyword SELECT would be used to produce a listing of the names 
of the files (step 1060). After step 1060, the server has completed the formulation of 
the query, step 1070. Once the ILocation is translated to an SQL query, the server 
queries the database. 

Referring back to Fig. 8, the server maps the new ILocations corresponding to 
1 0 the directories in the response to file handles (step 865). The server responds to the 
client with data that is organized into a directory tree structure according to NFS (step 
870). The data includes the associated file handles for the files and directories 
supplied in the response. The directories listed in the response correspond to the first 
unbound classifier values. The files listed in the response are the data that have the 
1 5 values specified by the bound classifiers that have a NULL value for the first unbound 
classifier or, if the are no unbound classifiers, the files listed are the entire set of 
observable data as defined by the bound classifiers. 

To view the contents of a directory listed in the response, in step 875 the client 
requests further data by sending the file handle of the desired directory. In step 880, 
20 the server looks up the file handle in the ILocation table. In step 885, the server 

formulates an SQL query using the method described above and queries the database. 
After querying the database, in step 890 the server maps the new ILocation(s) to files 
handle(s). In Step 895, the server responds to the client with data and file handles for 
the associated files and directories. 

25 After step 895, the client can request to search further into the directory tree 

structure by returning to step 875 and requesting further data. After step 870 or step 
895, the client may not desire to parse the directory tree structure further and will 
return to step 850. Step 850 returns the client to requesting data at the top level of a 
personalized directory tree structure according to the client's view. 
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There are many advantages to providing a hierarchical interface, such as a 
directory tree structure, for a database. Many clients already understand file systems 
and do not need special database software to formulate an SQL query. The directory 
tree structure allows many applications to present data in a familiar way to many 
5 clients. Many different tools are able to access a database without having database 
specific software. The database is reduced to flexible file system. The present 
invention, in at least one embodiment, creates an NFS interface with a rearrangeable 
hierarchy, creating a virtual file system based on a database. In addition, due to the 
individual client views, the present invention, in at least one embodiment, utilizes 
1 0 NFS to mount whatever file structure the database administrator selects for a 

particular client. Embodiments of the present invention enable flexible restriction and 
presentation of data. 

The present invention can be implemented on a computer system such as a 
general-purpose computer 1 1 00 illustrated in Fig. 1 1 . Input user device(s) 1110, such 

15 as a keyboard and/or mouse, are coupled to a bi-directional system bus 1118. The 
input user device(s) 1 1 10 are for introducing user input to the computer system and 
communicating that user input to processor 1113. The computer system of Fig. 1 1 
also includes a video memory 1114, main memory 1115 and mass storage 1112, all 
coupled to bi-directional system bus 1118 along with input user device(s) 1110 and 

20 processor 1113. The mass storage 1112 may include both fixed and removable 

media, such as other available mass storage technology. Bus 1118 may contain, for 
example, 32 address lines for addressing video memory 1 1 14 or main memory 1115. 
The system bus 1118 also includes, for example, a 32-bit data bus for transferring data 
between and among the components, such as processor 1113, main memory 1115, 

25 video memory 1114 and mass storage 1112. Alternatively, multiplex data/address 
lines may be used instead of separate data and address lines. 

Computer programs and data are generally stored as instructions and data in 
mass storage 1112 until loaded into main memory 1 1 15 for execution. Computer 
programs may also be in the form of electronic signals modulated in accordance with 
30 the computer program and data communication technology when transferred via a 
network. The method and functions relating to the present invention may be 
implemented in a computer program alone or in conjunction with other configuration 
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technology. Furthermore, context subsystem data structures can be implemented in 
general-purpose computer 1 100 and utilized by general-purpose computer 1 100 or by 
other data processing systems that have access to the data structures. 

In one embodiment of this invention, the processor 1 1 13 is a 32-bit 
5 microprocessor manufactured by Motorola, such as the 680X0 processor or 

microprocessor manufactured by Intel, such as the 80X86, or Pentium processor. 
However, any other suitable microprocessor or microcomputer may be utilized. Main 
memory 1 1 1 5 is comprised of dynamic random access memory (DRAM). Video 
memory 1 1 14 is a dual-ported video random access memory. One port of the video 
10 memory 1114 is coupled to video amplifier 1116. The video amplifier 1 116 is used to 
drive the display 1117. Video amplifier 1 1 16 is well known in the art and may be 
implemented by any suitable means. This circuitry converts pixel data stored in video 
memory 1 1 14 to a raster signal suitable for use by display 1117. Display 1 1 17 is a 
type of monitor suitable for displaying graphic images. 

15 The computer system described above is for purposes of example only. The 

present invention may be implemented in any type of computer system or 
programming or processing environment. It is contemplated that the present 
invention might be run on a stand-alone computer system, such as the one described 
above. The present invention might also be run from a server system that can be 

20 accessed by a plurality of client computer systems interconnected over an intranet 
network. Finally, the present invention may be run from a server that is accessible to 
clients over the Internet. 

Fig. 12 illustrates a software architecture diagram. A hierarchical interface 
layer 1210 receives requests from a client. Hierarchical interface layer 1210 accesses 

25 a database query engine layer 1220. Database query engine layer 1220 accesses 
database 1230. Database 1230 returns data. 

Hierarchical interface layer 1210 typically performs processes such as those 
described in Figs. 8 and 9. The processes performed typically include translating a 
client request into a request understood by database query engine layer 1220 receiving 
30 data from database query engine layer 1220, and translating the received data into a 
format understood by the client. Database query engine layer 1220 typically performs 
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processes such as those described in Fig. 5. Database query engine layer 1220 is well 
understood in the art and not discussed in detail here. Database 1230 is typically a 
repository of data, consisting of one or more tables of data. Database 1230 can be one 
or more databases, contained on one or more computer systems or servers. 

5 Hierarchical interface layer 1210 provides a client with a hierarchical 

presentation of the data contained within the database. Hierarchical interface layer 
1210 allows the presentation of the hierarchical data to vary per client, such that the 
hierarchy is tailored to the needs of each client. The client therefore interacts with the 
database 1230 through hierarchical interface layer 1210, and thus the client does not 
1 0 need to have a database query engine. Hierarchical interface layer 1210 shields the 
client from query engine layer 1220 such that the client also does not need to 
understand query languages and classification parameters to access database 1230. 

While particular embodiments of the present invention have been shown and 
described, it will be obvious to those skilled in the art that changes and modifications 
1 5 can be made without departing from this invention in its broader aspects and, 

therefore, the appended claims are to encompass within their scope all such changes 
and modifications as fall within the true spirit and scope of this invention. 
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